Extreme Value Theory Based Text Binarization In Documents and Natural Scenes

نویسندگان

  • Basura Fernando
  • Jean Monnet
  • Sezer Karaoglu
  • Alain Trémeau
چکیده

This paper presents a novel image binarization method that can deal with degradations such as shadows, nonuniform illumination, low-contrast, large signal-dependent noise, smear and strain. A pre-processing procedure based on morphological operations is first applied to suppress light/dark structures connected to image border. A novel binarization concept based on difference of gamma functions is presented. Next Generalized Extreme Value Distribution (GEVD) is used to find proper threshold for binarization with a significance level. Proposed method emphasizes on region of interest (with the help of morphological operations) and generates less noisy artifacts (due to GEVD). It is much simpler than other methods and works better on degraded documents and natural scene images. Keywords-Generalized extreme value distribution; Geodesic transform morphological reconstruction; Connected opening; Text binarization

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting Text in Natural Scenes Based on a Reduction of Photometric Effects: Problem of Text Detection

In this paper, we propose a novel method for detecting and segmenting text layers in complex images. This method is robust against degradations such as shadows, non-uniform illumination, low-contrast, large signaldependent noise, smear and strain. The proposed method first uses a geodesic transform based on a morphological reconstruction technique to remove dark/light structures connected to th...

متن کامل

Detecting Text in Natural Scenes Based on a Reduction of Photometric Effects: Problem of Color Invariance

In this paper, we propose a novel method for detecting and segmenting text layers in complex images. This method is robust against degradations such as shadows, non-uniform illumination, low-contrast, large signaldependent noise, smear and strain. The proposed method first uses a geodesic transform based on a morphological reconstruction technique to remove dark/light structures connected to th...

متن کامل

An Analysis of Image Binarization Techniques for Natural Scene Images

Text extraction from natural scene images is an emerging field in computer graphics. Extracted text contains important information that can be used for various purpose like vehicle number plate detection to identify the vehicle, to provide information of surrounding to visually impaired persons, preservation of information of historical documents etc. Binarization is a key process in text extra...

متن کامل

Font and Background Color Independent Text Binarization

We propose a novel method for binarization of color documents whereby the foreground text is output as black and the background as white regardless of the polarity of foreground-background shades. The method employs an edge-based connected component approach and automatically determines a threshold for each component. It has several advantages over existing binarization methods. Firstly, it can...

متن کامل

Towards Text Recognition in Natural Scene Images

In this paper, we propose a novel methodology for text detection in natural scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully processes natural scene images having shadows, non-uniform illumination, low contrast and large signaldependent noise. Conn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010